10/587482 

WO 2005/073668 PCT/SE2005/000100 

1 1AP20 Rec'd PCT/PTO 27 JUL 200$ 

A METHOD FOR MEASURING THE POSITION OF A MARK IN A DEFLECTOR 
SYSTEM 

Technical field 

The present invention relates to a method for determining the 
5 coordinates of an arbitrarily shaped pattern on a surface in a 
deflector system, as defined in claim 1 and 10. The invention 
also relates to software implementing the method for 
determining the coordinates of an arbitrarily shaped pattern 
on a surface in a deflector system, as defined in claim 14 . 

10 Background to the invention 

The method used for measuring time in a deflector system has 
been used many years. Almost no modifications in the algorithm 
have been done so far. Only the pattern used for different 
kinds of calibrations has been modified during the years. 
15 Today we have an experimental verified repeatability of the 

method in the range of 10-15 nm over a surface of 800x800 mm. 
The 10-15 nm means here the measurement overlay. 

One drawback of the method used is that we so far only can 
measure in the same direction as the micro sweep. In order to 
20 measure an X-coordinate we therefore must use special patterns 
containing 45 -degree bars. 

The method according to prior art is briefly described, since 
it is important to understand the present invention. 

It is difficult to measure time with high accuracy. If, for 
25 example, you want to measure a pulse with the resolution of 1 
nanosecond (ns) you need a measurement clock with the 
frequency of 1 GHz if classical frequency measurement methods 
are used. In the described prior art system, there is no need 
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to measure a single shot of a pulse. The use a scanning beam 
while measuring will get several one -dimensional images of a 
bar or several bars, as an example. Only the "average" 
position of an edge or the CD of a bar is interesting. The 
5 measurement system will only give an average result together 
with its sigma. It is important to remember that the 
measurement system is good enough if this sigma is lower that 
the natural noise in the system. This natural noise can be 
summarized to be laser noise, electronically noise and 

10 mechanical noise. The noise from the measurement system itself 
can be calculated theoretically or verified in practice with a 
known reference signal. It is also possible to get a figure of 
the measurement system noise by simulation. The measurement of 
the position of the bar or the CD will therefore contain the 

15 error: 

Em*** = ^( Err0r naturalY +{ Err ° r measurement) 1 

When we measure time we use a so-called random phase method. 
What this means is that the measurement unit it-self is 
completely un-correlated in phase to the signal we want to 
20 measure. Due to the fact that the signal phase is random 

relative the measurement clock phase we can use a measurement 
clock frequency that is much lower and use an "averaging" 
effect instead to achieve the accuracy. 

In figure 1 the measurement clock phase is shown relative the 
25 reference signal (SOS) . Please note that the input signal (the 
bar) is synchronized with the reference signal since it is 
generated from the micro sweep itself. The upper row of clocks 
in figure 1 is the ruler marked in measuring clock increments. 
What we are after is wher e the positive going edge 10 of the 
30 input signal is relative our reference signal. Of course we 
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also are interested of the negative going edge 11. But the 
same method may be used to find the position of any edge. 

Let us call the period time of the measurement clock tm. Since 
the input signal is a result from the micro sweep we also know 
5 exactly the relationship between the pixel clock period in 
time and what that corresponds to in nanometers. Here we 
introduce tp for the pixel clock period in nanoseconds. We 
also call the pixel clock period in nanometers for pp. The 
scaling expression can therefore be expressed as: 

x ppinm) , x 
1 0 pm(nm) = tm(ns) 

tp(ns) 

pm is what each measurement clock period corresponds to in 
nanometers. From figure 1 we can see that the approximate 
position of the first edge, denoted 10, is 8 pixel clocks. 
Please note that by doing only one measurement i.e. using one 
15 of the six measurements 1-6 we can se that the edge is within 
the range of 8 - 10 measurement clocks. The accuracy is in 
other words 2*tm. Using above scaling expression this can be 
expressed in nanometers too. 

In the following some realistic numbers are introduced. 
20 tm= (1/40) = 25 ns . 

tp= (1/46,7) = 21.413 ns. 
pp=250 nm. 

This results in that the pm= 291.86 nm. 



25 



If we now count measurement clock ticks by resetting a counter 
by the reference signal we see that we only will count 8 or 9 
ticks. No other count is possible in this example. The edge 
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position relative the phase of the measurement clock will in 
this way be rectangular distributed inside tm. The average 
position can therefore be calculated just by adding counts 
from several measurements together and divide this number with 
5 number of measurements. In this example we get (8+8+8+8+9+9) /6 
= 8.33 counts as an average value. So an estimation of the 
position of the edge can be calculated to be: 

8.33 x 291.86 = 2432 nm. 

Now it is not enough just to use 6 measurements as in this 
10 example. Normally you use several thousands of measurements. 

(In the detailed description, the three sigma of the average 
value is described from a theoretical point of view.) 

Summary of the invention 

An object with the present invention is to provide a method 
15 for determining coordinates, especially in two dimensions, in 
a deflector system using any kind of pattern. 

A solution is achieved in the features as defined in claim 1 
and 10 . 

Another object with the present invention is also to provide 
2 0 software for performing the method, which is provided in the 
features defined in claim 14. 

An advantage with the present invention s that it is possible 
to generate an image of the pattern without using any other 
detection method than the one we already are using today, 
25 since the present invention is similar to the prior art 
method, except that it is 90 degrees rotated. 
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Another advantage is that no new hardware is needed since the 
present invention is implemented in software. 

Brief description of the drawings 

Fig. 1 illustrates the method for measuring time in the same 
direction as the micro sweep according to prior art. 

Fig. 2 shows an image of the star-mark that could be used for 
measuring time and position according to the invention. 

Fig. 3 shows an enlargement of a part of the image in fig. 2. 

Fig. 4 illustrates the principal measuring technique of a 
horizontal bar according to the invention. 

Fig. 5 illustrates the principal measuring technique of a 
vertical bar according to the invention. 

Fig. 6 illustrates the preferred method to obtain X-coordinate 
using a sweep in Y-direction according to the invention. 

Fig. 7 shows an image obtained by using the method according 
to the present invention. 

Fig. 8 shows an enlargement of the image in fig. 7. 

Fig. 9 shows cursors applied to the image presented in fig. 7 

Figs. 10 and 10b show expanded views of the cursors in fig. 9. 

Fig. 11 shows a graph illustrating the average speed for a 
measurement . 

Fig. 12 illustrates the statistic principle behind random 
phase measurement as is used in a preferred embodiment 
according to the invention. 
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Detailed description of the preferred embodiments 

So far we only have used this method to measure along the 
micro sweep i.e. in one dimension. It is though possible to 
extend the method to measure in two dimensions. When we do 
5 this we actually are generating images of the pattern we 
measure . 

When we talk about images we normally see this as a set of 
pixels. (Each pixel has a certain "gray-level" that describes 
the intensity of the pixel) . 

10 When handling CCD images each pixel is fixed in position in a 
certain raster (or grid) . When analyzing a CCD image for 
finding the position of an edge both information of the 
pixel's location and gray-level must be used. Different 
straightforward methods may be used for estimating an edge 

15 position in the image. The accuracy of the position estimation 
depends in the calibration of the CCD array i.e. where the 
pixels are located in the array, how sensible they are for 
light and how well we can place the image on the array without 
any distortions. Light distribution over the CCD and different 

20 kinds of optical distortions will contribute to the error of 

the position estimation. A lot of these errors can be overcome 
if we calibrate the measurement system against a known 
reference . 

When using the method according to the invention we also refer 
25 to pixels. But our pixels are not fixed in location in a 

certain grid. If we make a "snap shot" of the pattern by just 
measuring it once we will get information with a quite rough 
resolution (or accuracy) . It is important to realize that the 
only information we are using is the pixels location. We do 
30 not use any gray-level information at all. Of course it is 
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possible also to use gray- level information by recording the 
pattern using different "trig" levels in the hardware. This is 
what we do if we are interested in beam-shapes as in focus 
measurements. Here we only are interested in measuring the 
5 location of one or several bars so we can calculate center of 
gravity and CD. 

When measuring registration and CD we never are interested in 
the exact location of one single pixel. Normally we only are 
interested in the average of several pixels location. In a CD 

10 measurement we use cursors to define number of pixels to be 
used in this average value. Also in the center of gravity 
estimation we use cursors to "even out" noise from the edge. 
This noise might be roughness from the pattern itself or noise 
in the measurement system. This is the same when using a CCD 

15 image as input. 

In this suggested method we use the micro sweep itself as our 
light source (or ruler) . It is hard to find a more accurate 
ruler than this. We already have methods to calibrate this 
ruler both in power and linearity very accurately. 

2 0 In figure 2 we have captured an image 20 of a part of our 

star-mark. The image shows the location of pixels 21 in a grid 
of (316x250) nm. Nothing more than just showing the pixels in 
this grid has been done in the image. The image 20 shows so 
called events in the area. The mark has been scanned with a 

25 hardware cursor of 100 urn. The positive going edges 22 are 
shown as white pixels and negative going edges 23 (chrome- 
glass transitions) are shown as black pixels. Just by 
observing this image you can see that the mark is slightly 
rotated counter clock vice. The number of black pixels in the 
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lower Y-parallel bars 24 compared to whites ones is a clear 
indication of this fact. 

In order to demonstrate the actual grid we are using and how 
the pixels are distributed in this grid we refer to figure 3. 

5 Here we have enlarged a part of the image 20. This "hard copy" 
of the image shows clearly where we have found events. The 
method to "sharpen" up this image will be presented below. The 
scale in this image is correct in that sense that one pixel is 
316 nm in X-direction (vertical scale) and 250 nm in Y- 
10 direction (horizontal scale) . 

Estimation of the X-coordinate 

As has been described in the background to the invention, 
there exists a very accurate method to estimate the Y- 
coordinate of an event. The micro sweep is used as a ruler and 

15 a measuring clock that is random in phase relative the ruler. 
The measurement clock will give us a rough resolution of tm 
(292 nm) in a single shot measurement. If we use several 
measurements and build us an average value we will get a much 
higher resolution (see below) . Actually we can choose the 

20 accuracy just be selecting number of measurements and the 

length of the cursor to be used. So far this is true for the 
estimation of the Y-coordinate. The problem is how do we do to 
estimate the X-coordinate? 

Obviously it is difficult to believe that it is possible to 
25 get an X-value out from data retrieved by a scanning a beam in 
Y-direction. The big step forwards is that it actually is 
possible to retrieve this information almost with the same 
accuracy as the Y-coordinate. But to get it we must introduce 
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another signal (that actually already is used in the system) , 
the lambda/2 X-signal. 

In the prior art, when measuring a 45-degree bar of a pattern 
as in the star-mark case, we use the X-lamda/2 signal as 
5 "marks" in X-direction to define an X-cursor. Inside the 

cursor we also record the lamda/2 signal simultaneously when 
we count the measurement clocks. But since we measure on a 45- 
degree bar we actually are using only Y-inf ormat ion to get the 
X-coordinate . In combination with the lambda/2 information we 

10 can calculate the X-coordinate with a very high accuracy. The 
drawback of this method is of course that we are not able to 
measure on any kind of pattern. Especially we cannot measure 
on a bar that is parallel with the ruler. If we extend the 
method we already are using in Y-direction a little bit, we 

15 will soon realize that the problem to solve is exactly the 

same as we have in Y-direction but rotated 90 degrees. If we 
change our measurement clock to our reference signal (here the 
SOS - Start Of Sweep) and use the lamda/2 signal as reference 
instead we have rotated the problem 90 degrees. 

20 When doing this "rotation" of the problem we need to re- 
calculate our parameters. In Y-direction our resolution was 
one measurement clock that corresponded to 2 92 nm. During one 
run over the pattern of interest we scanned it with a 
frequency of approximately 30 kHz. The question now is how far 

25 we move in X-direction between the scans. If we set the speed 
as low as possible we will retrieve about 8-10 scans of the 
pattern in each lambda/2 period. Since one lambda/2 period 
corresponds to 316 nm we have a resolution in the range of 30- 
40 nm in X-direction. This is because we scan the pattern with 

3 0 the frequency of 3 0 kHz during the movement in X-direction. 
Now when we use the lamda/2 signal as the reference we 
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therefore have a "clock" with a spatial resolution of 30-40 nm 
in X-direction. This is significantly higher than the 
resolution in Y-direction. But, and this is important, we will 
not get as many samples in X-direction as in Y because of the 
5 movement in X. This fact is illustrated in figure 4. 

The situation in X-direction is shown in figure 4. A bar 40 is 
scanned in one stroke (run) and generates one event only in 
the six scans 41. So when moving over the bar one time we know 
the position of the bar with an accuracy of +/- 40 nm in the X 

10 direction. The Y-coordinate of the bar location is known with 
the accuracy of + /- 292 nm (in each of the two edges 42 and 
43) . In figure 4, the CD in X-direction of the bar is lower 
than the 40 nm measurement grid we use in X-direction. So. just 
running one time over the pattern might miss the fact that 

15 there is a bar at all. 

This is natural since the resolution is lower than the CD of 
the bar to be measured. In order to measure the bar with 
higher resolution you need to do several runs over the pattern 
with random phase. 

20 A comparison of the situation in Y-direction is illustrated in 
figure 5. Here you are scanning a bar 50 with the same length 
in Y-direction. The resolution in Y-direction in each scan 51 
is 2 92 nm but you retrieve 7 scans over the same length of the 
bar . 

25 If we separate the problem we can say that in one scan we can 
resolve a pixel with the resolution 40 nm in X-direction and 
290 nm in Y-direction. 



The algorithm. 
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So far we have described the main principle in Y and X 
direction. We have rotated the problem in Y 90 degrees to X. 
In Y-direction we have two processes that are random relative 
each other, the measurement clock and the SOS (or any 
5 correlated signal to SOS) . In X-direction the measurement 

clock corresponds to the SOS signal and the reference is the 
lamda/2 signal. Also these signals (or processes) are un- 
corrected. We have different resolution in the different 
directions but it turns out that the accuracy is almost the 
10 same. 

In figure 6, the principle of how to get the X-coordinate of a 
bar 60 is described. In figure 6 we see a bar 60 that is 
parallel with the ruler and micro sweep direction. The 
reference signal is the lambda/2 positions. In each lamda/2 

15 interval we scan the pattern 61 with the micro sweep (our 
ruler) . The movement over the pattern in X direction is 
performed with a much lower speed compared to the speed used 
when exposing a pattern. In this example we get about 8 scans 
61 in the lamda/2 interval. If we now start to count SOS in 

20 the interval we will have a very similar situation as 

described above. If we count the total number of scans in the 
interval this will be a measurement of the speed in the 
interval. Since we cannot assume that the speed is the same in 
all the intervals it is important to do this speed calculation 

25 in order to get the correct "weight" of an event in the 

interval in X-direction. In this example we will get a Y-event 
(a transition from glass to chrome) when we have counted two 
SOS in the first run 62, three in the second 63 and so on. So 
just adding the w index" of the event inside an interval and 

30 divide this number with total number of SOS in the interval 
will give us an estimation of the X-coordinate of an event 
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inside a certain interval. Above we will get the approximate 
position of the first Y-event after three runs over the mark 
to be (2+3+2) /3 = 2.3 SOS tics in the interval. To calculate 
what this corresponds to in nanometers we just multiply this 
5 number with the local resolution. 

Here we get 2.3 *316/8 = 92.2 nm. This is the local coordinate 
64 for the edge of the bar 60 in the first interval. The local 
resolution depends on the speed, i.e. total number of SOS in 
the interval. If we can run the system more slowly this 
10 resolution will be better. But you will also gain resolution 
by scanning the bar in several runs. Below, the accuracy of 
the average position estimation is discussed. 

As can be seen from above discussion we actually can calculate 
the X- coordinate from data retrieved from a scanning sweep in 

15 Y-direction. What we do is using the fact that we know exactly 
where we are in X-direction every time we pass an interval 
border 65. Inside an interval we only must assume that the 
speed is constant. This of course does not mean that the speed 
needs to be constant over all intervals. In practice we run 

20 several times across the pattern in both directions and record 
the Y-events and lamda/2 positions simultaneously. We 
therefore have the possibility to calculate the local speed 
with high accuracy by using information from all the runs. 

The method described above is suitable to be used in either a 
25 laser lithography system or an e-beam lithography system. 

Filtering 

What we really are after is not the exact position of an 
individual pixel . The discussion so far has lead us to that 
the position accuracy of a single pixel depends of how many 
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times we have recorded the pattern and the resolution we use 
during the recording. If we scan the pattern a certain number 
of times we can "select" the accuracy we want before hand. 
This can be done since we have full control over the 
5 measurement process. When we do this "accuracy" selection we 
also must consider our cursors. As have been mentioned before 
a cursor is just another way to define number of pixels to use 
for calculating an average value. 

There are many ways to apply a filter to this kind of data. An 

10 obvious way might be to fit a line using standard regression 
techniques . These techniques works but does not generates the 
optimum result in this case. The main reason is that the pixel 
data we handle does not describe a Gaussian distribution. We 
have a more or less rectangular distribution to deal with. 

15 When using a regression technique we therefore will "over 

weight" pixels close to the border of a lamda/2 interval or 
the tm interval in the Y-case. A much better method to use is 
the more simple "area" estimation method. This method is also 
more accurate for this kind of data compared to the regression 

20 technique. To fit a line to an edge you just divide the 

database in two half's. In this case the data you have is x,y 
coordinates. You calculate the average value of all 
coordinates in each half. This way you will get two x,y 
points. These two points describes the line to be used in 

25 further calculations. 

Some real results 

In figure 7 we have filtered the data using the above 
described algorithm. So far we have not applied any cursors. 
Only the average locations of the pixels have been calculated. 
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The image 7 0 shown has been built from four runs over the 
mark . 

The small square 71 in the image 70 is enlarged in figure 8. 
Here we have used the algorithm and some filtering in order to 
5 "sharpen up" the data. Each pixel in this image is a result of 
all four runs over the pattern. 

Cursors 

We now will apply cursors to the data in order to measure the 
CD and center of gravity position of the cross. The center of 
10 gravity of the cross is measured using four cursor pairs. 
These cursors are shown in figure 9. 

Each line 90, 91 of the cursors is calculated based on the 
data from the edge in the cross. The line is calculated by 
using the simple "area" estimation method described above. 

15 In figures 10a and 10b, a part of an X and Y bar is expanded. 

Figure 10a shows a part of the upper left edge. The calculated 
cursor is an accurate estimation of the position of the edge 
in X-direction. 

Figure 10b is a part of the upper right edge of the cross. The 
20 position of this line 91 defines the edge position in Y- 
direction. 

The reason for the mixture of white and black pixels along the 
Y-bar in figure 10a can be explained. The hardware has, in 
this example, a limitation in that it can only re-trig on an 
25 event after two clock periods of the measurement clock. This 
means that if we have a positive and negative transition 
inside this time period we will miss one of the events. This 
is one of the reasons that the pixels are a bit spread in Y- 
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direction. Then because of the noise the hardware will trig 
randomly on a negative or positive transition. This is 
actually no problem due to the fact that if we have a positive 
or negative transition is not so important information. What 
counts is where the transition occurs. To know the "direction" 
of the edge we can use several transitions or other kinds of 
logic decisions to know which type of transition we have. 

In below table the center of gravity and the CD is presented 
for the cursors. Below table shows the result of the four 

File: f d f 06Q2 105508. 3d Hw cursor: 99.22 urn 



Y-cursor 0 



X0 (urn) YO (urn) 


XI (urn) 


Yl(um) 


Centre (um) 


CD(um) 


38.843 


53.380 




46.111 


14.537 


Y-cursor 1 










X0(um) Y0(um) 


XI (urn) 


Yl (urn) 


Centre (um) 


CD(um) 


53.348 


3B.789 




46.069 


14.558 


X-cursor 2 










X0(um) Y0(um) 


XI (urn) 


Yl(um) 


Centre (um) 


CD (um) 


100.648 




115.106 


107.877 


14.458 


X-cursor 3 










X0(um) Y0(um) 


XI (urn) 


Yl(um) 


Centre (um) 


CD (um) 


100.690 




115.128 


107.909 


14.439 



cursor pairs separately. 

The center position of the mark (Xcenter , Ycenter ) may be 
calculated as the average value of the Y-cursor center values 
(Xcenter) and the X-cursor values (Ycenter) . 

Second order effects. 

So far we have discussed the main principles of the algorithm. 
We will now discuss two vital corrections that must be done on 
the data that are second order effects from the method. 

First we need to correct for an eventual azimuth angle in the 
data. If we use a writer (as done in this case) we have a pre- 
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misalignment between the X-movement direction and the ruler 
This angle a can be expressed as: 



a = a tan 



f \ 
vx 



Where vx is the exposure speed of the system and vy is the 
5 speed of the micro sweep. 

This angle calculation can be reduced to the expression: 

Number of beams 
a- ~ 

Sos _ rate 

Where the Sos _rate is total number of pixel clock periods 
between two SOS. (See below for a more thorough explanation). 

10 Another effect that must be taken care of is the effect of the 
X-movement during a measurement. Also here we will introduce 
an "azimuth" error. Even if we run the same number of positive 
strokes and negative strokes we will not cancel out this error 
completely. The reason is that this error has to do with the 

15 difference in speed for a positive and negative stroke. For a 
stroke in one direction we will therefore get an error that 
may be expressed as an angle (j3) . 

This angle can be expressed as: 
_ xlnc 1 



speed (Sos _ rate • yPix) 



20 where xJnc is lambda/2 [nm] and speed is total number of start 
of sweeps inside the xJnc interval. If we divide p with a we 
will get a relation between the angles. 



WO 2005/073668 PCT/SE2005/000100 

17 

P Xlnc 1 

a speed (nbeams • j>Pjjc) 

If we put in some realistic numbers, xlnc = 316 nm, Speed = 8 
Sos/interval , nbeams= 9 beams and yPix = 250 nm, we get: 

5 Z = 2I5 ^ = 0.0175 

a 8 (9 • 250) 

If we calculate the error generated by a on a distance of 100 
urn we will get : 

alpha_error = 100 * 9 / 1435 = 0.6272 {im. (The Sos__rate is 
taken from TFT3 system parameters) . Since the (3 = 0.0175 * a 
10 we can calculate the error generated by the fact that we are 
moving during measurement to be : 

0.0175 * 627.2 [nm] - 11 nm. This is a quite large error that 
cannot be neglected. This error will change sign depending of 
the direction of the measurement. If we measure during the 
15 same number of positive and negative strokes and the local 
speed is the same for both strokes this error will be 
cancelled out completely. In practice this is not the case. We 
will therefore get a small net-error due to this fact. 

In the graph shown in figure 11, the average speed is 
2 0 presented for a measurement. 2 Forward strokes and 2 backward 
strokes were used. The hardware cursor was 9 9.22 urn (314 
lamda/2 intervals) . As can be seen there is a significantly 
difference in local speed for the forward and backward stroke. 

Random phase measurement 

25 When using a random clock for measurement we shall see this as 
a statistical problem. In figure 12 the measurement situation 
is illustrated. What we want to measure is the time tp that is 
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the difference between tl and to. The signal is synchronized 
with the reference signal. 

We re-write the time tp as: 
' tp = (k+d) *tm 

5 Where k is an integer number and d is the decimal part of tin. 
If we do this d will be a number in the interval [0 , 1[. It 
will be shown later why this is a reasonable expression to use 
for tp. 

We now introduce the measurement clock with a phase that is 
10 random relative the reference signal. We also introduce a 

counter that counts the positive going flanks of this clock. 
If we reset this counter with the reference signal we realize 
that we sometimes will count k flanks and some times k+1 
flanks. No other counts are possible. We introduce the 
15 discrete stochastic variable K that in this way can get two 
values k and k+1. 

We now look in figure 12 and introduce another stochastic 

variable, A which describes the phase of the clock relative 

the reference signal. A sample point of A (oc ) will be a 

20 number in the interval [0 y. 1[. A is a continuous stochastic 
variable . 

In figure 12 we can see the following important facts: 

If oc > d then the sample point of K will be k. 
If oc < d then the sample point of K will be k+1 

25 What we now must do is to calculate to probability for the 

sample point k and To do this we must use the frequency 
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function shown in figure 12 above. Since all phases have the 
same probability this is a rectangular distribution function. 

We have : 

d 

P k+X = Jf(*)* = </, (oc<d) 

0 

5 P k =l~d (a>d) 

So the probability that we get the sample point Jc+1 out from K 
will be d and the probability that we get the sample point k 
out of K is (1-d) . 

< 

When we add the clock counts for each measurement and then 
10 divide with n we actually is estimating the average value for 
the stochastic variable K. 

The estimated mean value may be expressed as: 

-a 

Here we have only two possible sample points so we get: 
15 E(K) = k(\-d) + (k + \)-d = k + d 

So when we rescale this result to nanoseconds we get 
(k + d)-tm=tp . 

This result proves that building the average value of the 
counter tics and scale this value with tm will give us the 
20 time we are after. 

The sigma 
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To calculate the accuracy of the average value E(K) we need to 
find the variance of K. 

The variance of a distribution may be expressed as: 
*W = J(*-ai) 2 p{k) (1) 

-a 

5 This can be re-written as: 

V(K) = E(K) 2 + [E(K)f 
We get : 

V(K) = k 2 -(l-d) + (l + k) 2 -d-(k + d) 2 = d(l-d) 
and 

1 0 D(K) = sigma = ^d-{\-d) 

The variance function is actually very interesting. We see 
that if d=0, that means that we have no decimal part V(K)=0 we 
also see that if d is very close to 1, V(K)=0. Actually the 
variance has its maximum when d=0.5. In this case the variance 
15 is 0.25. The sigma will therefore be 0.5 as its maximum. 

To interpret this you may think as follows. If d is 0 we 
always will count k ticks from the counter. Here we also 
assume that we count one tick if the positive going edge from 
the clock coincides with the reference signal. Since we always 
20 is counting k ticks independently of the phase of the 

measurement clock the spread also from the average value will 
be zero since variance is a measurement of the squared 
distance from the estimated average value. (Please refer to 
equation 1 above) . 
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What is then the physical meaning of this? 
Let us first make a practical example. 

If we measure a signal with the decimal part 0.01 and k=2 the 
probability of counting a 3 in a measurement will be 0.01. 
5 This probability is the same for each measurement. Now if we 

calculate the average of 100 measurements we will probably add 
99 samples of 2 and one sample of 3 (Case 1) . But it is also 
possible that we add 100 samples of 2 and no samples of 3 
(Case 2) . The error we actually have in the average value is 
10 then: 

01 lfl^ = 0£i = 0.0Q9 

Vioo Vioo 

So after 100 measurements in case 1 we will get: 
99-2 + 1-3 201 

- = — = 2.01 ±0.005 , and in case 2: 2.00 +/- 0.005 
100 100 

There is another very interesting way to see the physical 
15 conclusion of the case when d=0. 

Assume that we want to measure a signal that is exactly k*tm. 
In this case the decimal part is zero. Now if we add counter 
ticks we must always count k ticks. Otherwise, and this is 
important, we should never get the correct average that is k 

20 in this case. In other words we cannot ever count k+1 ticks. 
If this would be the case the average we calculate would not 
be k. For this reasons the variance must be zero. Please note 
that only two numbers can generally be counted, k and k+1 . So 
the value k-1 can never be counted. So in other words a count 

25 that is Jc+1 cannot be compensated by a value k-1 so we get the 
correct average anyway. 
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Since we do not know tp beforehand we should use the worst - 
case scenario when we estimate the error. In other words we 
shall say that the error due to the method is: 

Error (K) = 0.5 * tm [ns] . 

5 This is as shown above the maximum of the function d* (1-d) . If 
we want to use a symmetrical error instead we can express the 
method result as: 

tp = ((k + d)±025)tm[ns] 

The error in the method will go down if we use a large number 
10 of measurements. We can express the error as: 

measurement _ error = -~ • 0.5 tm [ns] 

This expression can be scaled to nanometers as: 
Error[nm] = -j= • 0.5 ■ rs [nm] , 

where rs is the actual resolution for the actual direction. If 
15 we put in some numbers, rs=2 91 nm in Y-direction and rs=4 0 

(316/8) nm in X-direction. So the error in the estimation of a 
pixel position in X or Y direction may be approximated to be: 



4~n 



y Error =—'145 [/fill] 



4n 



X Error =— ' 20 



20 The azimuth angle 
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In figure 13 the exposure case is illustrated. Between two 
start of sweeps we moves the distance nbeams * dy [um] in X 
direction, dy is the pixel size. We here assume a square 
pixel. In the same time we move N * dy [um] in Y direction 

The angle alpha (a) may be expressed as atan (vx/vy) . If we 
calculate this angle we get: 



a = a tan 



f nbeams * dy/sos _ time^ 
^dyj pixel clock _time j 



The sos_time may be expressed as N * pixel_clock_time . N is 
here the total number of pixels between two start of sweeps. 
10 Finally we therefore can express the angle alpha (a) as: 



( nbeams^ 

a - a tan 



N 



J 



Please note that this angle is a constant "compensation" that 
preferably is removed from the database. 



